55 research outputs found
An Ensemble-based Approach to Click-Through Rate Prediction for Promoted Listings at Etsy
Etsy is a global marketplace where people across the world connect to make,
buy and sell unique goods. Sellers at Etsy can promote their product listings
via advertising campaigns similar to traditional sponsored search ads.
Click-Through Rate (CTR) prediction is an integral part of online search
advertising systems where it is utilized as an input to auctions which
determine the final ranking of promoted listings to a particular user for each
query. In this paper, we provide a holistic view of Etsy's promoted listings'
CTR prediction system and propose an ensemble learning approach which is based
on historical or behavioral signals for older listings as well as content-based
features for new listings. We obtain representations from texts and images by
utilizing state-of-the-art deep learning techniques and employ multimodal
learning to combine these different signals. We compare the system to
non-trivial baselines on a large-scale real world dataset from Etsy,
demonstrating the effectiveness of the model and strong correlations between
offline experiments and online performance. The paper is also the first
technical overview to this kind of product in e-commerce context
Remote Work Optimization with Robust Multi-channel Graph Neural Networks
The spread of COVID-19 leads to the global shutdown of many corporate
offices, and encourages companies to open more opportunities that allow
employees to work from a remote location. As the workplace type expands from
onsite offices to remote areas, an emerging challenge for an online hiring
marketplace is how these remote opportunities and user intentions to work
remotely can be modeled and matched without prior information. Despite the
unprecedented amount of remote jobs posted amid COVID-19, there is no existing
approach that can be directly applied.
Introducing a brand new workplace type naturally leads to the cold-start
problem, which is particularly more common for less active job seekers. It is
challenging, if not impossible, to onboard a new workplace type for any
predictive model if existing information sources can provide little information
related to a new category of jobs, including data from resumes and job
descriptions. Hence, in this work, we aim to propose a principled approach that
jointly models the remoteness of job seekers and job opportunities with limited
information, which also suffices the needs of web-scale applications. Existing
research on the emerging type of remote workplace mainly focuses on qualitative
studies, and classic predictive modeling approaches are inapplicable
considering the problem of cold-start and information scarcity. We precisely
try to close this gap with a novel graph neural architecture. Extensive
experiments on large-scale data from real-world applications have been
conducted to validate the superiority of the proposed approach over competitive
baselines. The improvement may translate to more rapid onboarding of the new
workplace type that can benefit job seekers who are interested in working
remotely
A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection
The recent decade witnessed a surge of increase in financial crimes across
the public and private sectors, with an average cost of scams of $102m to
financial institutions in 2022. Developing a mechanism for battling financial
crimes is an impending task that requires in-depth collaboration from multiple
institutions, and yet such collaboration imposed significant technical
challenges due to the privacy and security requirements of distributed
financial data. For example, consider the modern payment network systems, which
can generate millions of transactions per day across a large number of global
institutions. Training a detection model of fraudulent transactions requires
not only secured transactions but also the private account activities of those
involved in each transaction from corresponding bank systems. The distributed
nature of both samples and features prevents most existing learning systems
from being directly adopted to handle the data mining task. In this paper, we
collectively address these challenges by proposing a hybrid federated learning
system that offers secure and privacy-aware learning and inference for
financial crime detection. We conduct extensive empirical studies to evaluate
the proposed framework's detection performance and privacy-protection
capability, evaluating its robustness against common malicious attacks of
collaborative learning. We release our source code at
https://github.com/illidanlab/HyFL .Comment: PETs prize challenge versio
Redefining cardiac biomarkers in predicting mortality and adverse outcomes of inpatients with COVID-19
The prognostic power of circulating cardiac biomarkers, their utility and pattern of release in coronavirus disease 2019 (COVID-19) patients have not been clearly defined. In this multi-centered retrospective study, we enrolled 3,219 patients with diagnosed COVID-19 admitted to 9 hospitals from December 31, 2019 to March 4, 2020, to estimate the associations and prognostic power of circulating cardiac injury markers with the poor outcomes of COVID-19. In the mixed-effect Cox model, after adjusting for age, gender and comorbidities, the adjusted hazard ratios of 28-day mortality for high-sensitivity cardiac troponin I (hs-cTnI) was 7.12 (95%CI, 4.60-11.03; P<0.001), NT-proB-type natriuretic peptide (NT-proBNP) was 5.11 (95%CI, 3.50-7.47; P<0.001), CK-MB was 4.86 (95%CI, 3.33-7.09; P<0.001), myoglobin was 4.50 (95%CI, 3.18-6.36; P < 0.001), and CK was 3.56 (95%CI, 2.53-5.02; P < 0.001). The cutoffs of those cardiac biomarkers for effective prognosis of 28-day mortality of COVID-19 were found to be much lower than for regular heart disease at about 49% of the currently recommended thresholds. Patients with elevated cardiac injury markers above the newly established cutoffs were associated with significantly increased risk of COVID-19 death. In conclusion, cardiac biomarker elevations are significantly associated with 28-day death in patients with COVID-19. The prognostic cutoffs for of these values might be much lower than the current reference standards. These findings can assist better management of COVID-19 patients to improve outcomes. Importantly, the newly established cutoff levels of COVID-19 associated cardiac biomarkers may serve as useful criteria for the future prospective studies and clinical trials
- …